CDS

Accession Number TCMCG042C56032
gbkey CDS
Protein Id XP_016488168.1
Location join(2006..2104,2793..2891,3005..3067,4702..4779,5257..5431,5732..5991,6091..6448,7569..7646,8734..8987,9069..9125,9267..9575,9655..9833,9952..10234,11182..11409,13966..14127,15309..15452,15530..15745,16743..16847,16933..17034,17722..17886)
Gene LOC107808192
GeneID 107808192
Organism Nicotiana tabacum

Protein

Length 1137aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA319578
db_source XM_016632682.1
Definition PREDICTED: protein ALWAYS EARLY 3-like [Nicotiana tabacum]

EGGNOG-MAPPER Annotation

COG_category BDT
Description SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
KEGG_ko ko:K21773        [VIEW IN KEGG]
EC -
KEGG_Pathway ko04218        [VIEW IN KEGG]
map04218        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGCTCCAGCAAGAAAATCTAGAAGTGTAAATAAGCGGTTTTCTCCTACAACGGAAATCTCTCCCAGTAAAGATGACAGTGCGAAGAAAAACTTGCGGAAGAGGAAGTTGTCCGACATGCTTGGTCCTGAGTGGAGTGAGGAAGATCTGACACGCTTCTATCAAGCATACCGCAAGTATGGTAAAGATTGGAAAAAGGTTGCTGCTGCAGTGAAACCCCGAACTTCAGAAATGGTGGAAGCTCTTTACATGATGAATAGGGCTTATTTATCTCTTCCAGAGGGGACCGCATCTGTGGTTGGGCTGATTGCCATGATGACTGATCACTACTGCAACTTGGCAGCAAGCGACAGTGAGCAAGAAAGTAATGAGGATGCTGGAACGTCTCGAAAACCTCAAAAACGTGCTCGGGGTAAAGTTCAGTCTAATATTTCTAAAGCATATGAGATGACATCTCCGACATTAGCAGCTAGTCATGGTTGTTTAACTTTGTTGAAGAAGAAGCGTTCAGGAGGAAGCCGGCCTCGTGCTGTTGGGAAAAGAACTCCACGCTTTCCTGTTTCTTTTTCTTGTGGAAATCCCAAGGGTGAGAAGTATTTTTCTCCTAGTAGGCAGAGTTTGAAACTACAGGCAGATGACACTGATGATGATGTGAAGATAGCATTAGTTTTAACAGAAGCTTCACAAAGAGGTGGCTCTCCTCAGGTCTCTCAGACACCGAACCGTAGGACGGACAGCGCTATGTCCTCACCTGTTGAGACAGCTGAAAGAAAGCATGTTAAAATAGGTATGGGAAATGCCAAGCTTCTTAGTAATGAAGTGGACGAAGAAGAGGGAAGCATGGAAGCGGACACTGGAGAGCTTTTGCGGTATAAGAATGATTCAGTGGAAACTGGAACCTTTGGTCGAACAGCACAGAAGGGAAGAAGACCTTATGGTAAAAAGTTGGAAATTGATGATAGTGGAGCCAATCATTTTGATGATATCAAAGAGGCATGTAGTGGTACAGAAGAAGGTCAAATATTGGGTGTGGTGAGGGGTAAACTTGAAATGGAGGCCACATATGAAAAGAATTCAAGGACCTCTTTACAAGGCCCTAGAAAGAGGAGCAAAAAAGTTCTTTTCAGCAGAGACGAAAGCTCCGCTTTTGATGCTCTACAAACGTTGGCTGATTTGTCTCTGATGATGCCAACAGCAGAAAATGAAGATGAGTCCATGATCCAGTTCAATGATGAACTTGATGATCATGTTGATGAATCTGGCTCCTTGGAGGCCGTGCCTGCAAACAGACATAGAGATAAACGTGGATCTGGGGGGGTTAGATCTAGATGGAGTCAACCTTTATCAAAGTTTGAAGTTGCTTCCACTACAAAATCAAAGCATGGTAAAGTTACATCTACTGATGTTAGTGCTGTTCCTGAAACAAAGCAGGCGAGGAGGGCACATAAAGCAATGTCGTCTAAGGCTCGAAAAACTGAAGGTCATGTTAACAATAATGTTGCTGGATCCGAGGAAGCTGAGGCAAAAGAAGCATCAAAGAAGTCAACCTATAAGGGTAAAAGATCCTATCAGAGTGCATCCCCGAAATTAATCAAAGATCAAGAGCCTTCATCATGTGCAGATCCAAGAACAGAACGAAGTGATTCAGCTCAATCAACTGCGGAGATCCCTGTGGCAAACCAGGTTAACTTACCTACTAAAGTCAGAAGCAGGCGTAAGATGGACCTGAAAAAACCTCAAAGACAAAAAGATTTGAAAATTCCTGATAAAAGTTTGGATGATACTAGTGCATCCTTCACTGCACTTCATGACAGAGCATTCAGTCTTAAGGAAAATATTTCCAATTGCCTTTCCAACCATCAGGTACGAAGATGGTGTACGTATGAGTGGTTCTACAGTGCAATTGACTACCCTTGGTTTGCCAAAAGGGATTTTGTGGAGTACCTGAATCATGTTGGATTGGGACATGTTCCAAGGTTAACTCGTGTTGAATGGGGCGTCATAAGAAGTTCTCTTGGAAAACCACGGCGATTCTCTGAGCAATTTCTGAATGAAGAAAAGGAGAAGCTTAATCAATACCGGGAATCTGTCAGAACACATTACACTGAACTTCGTGAAGGTACCAGGGAAGGACTACCCACAGATCTTGCAAGGCCATTGTCTGTTGGGCAACGAGTCATTGCCATCCATCCAAAAACAAGAGAGATTCATGATGGAAGTGTATTGACAGTTGATCGCTCAAGATGTCGTGTTCAGTTTGACCGACCTGAGCTCGGGGTTGAATTTGTCATGGACATTGACTGCATGCCTCTAAATCCATTTGAAAACATGCCCACACTACTTACAAGGCGTGCAGATGCTGTTGACAAGTTTTTCGAGAGTTTTAATGAGCTCAAGGTGAATGCACGAGTAAATGAGTATATGAAATTTCCGGCCTGTGACAACATGGAGAATGGAAATGTTTTCTCTCATTTCTCCCCACCAAGTCATCCCATCAGTGATCTCCTAAAGCAGACAAAGGTGGCTTCAGCAGAAGTGGATATGCAATCTAGATTTGGAGTCATGGAAACTGCCATATATCAATCGACAGCATATTCAAAGTCTTCTGGGGTCGCTCAGATTCAAGCAAAGGAAGCTGATGTTCAAGCTCTTGCTGAGTTGGCTCGTGCACTAGACAAAAAGGAAGCAGTGGTTTCAGAGTTGAGGCGCATGAATGATGATGTGTTGGAAAACCAAACGAGCAATGACTGTTCTCTTAAGGACTCAGAGACTTTCAAGAAGCAATATGCTGCCATGCTAATACAGTTAAATGAGGTCAATGAGGAGGTTTCTTCTGCTCTATATCGCTTGAGACAGCGGAATACCTATCAGGGAAGCATTCCACTTGCATTCCCAAGGCCAGTTCCAAATTTTGCTGATCCTAGTACGTTGAGCACTTTTGATCGTTGTACAAGTCAGTCGCAAGAATCAGGGTTCCTTGTCAATGAGATAATAGAAAGTTCAAAAATCAAAGCCCGGACTATGGTAGATGCAGCAGTGCAGGCGATGCTTTCACTTACTGATAGAGACAACACCACTGAAAAGATTGAGGAGGCTATTGGTTATGTGAATGATCGGATTCCACTAGATGATTCTTGCATGCCAACTCAACCTACTGATCCTAAGTCAAAGAATATGTCAGATAAAAATGAAGCAGAAATTCCTTCAGAACTTATTACTAAATGCGTCGCTACTTTGCTTATGATTCAGAAGTGTTCGGAACGACAGTTCCCACCAGCTGATGTAGCAAAAGTACTGGATTCCGCTGTAGCAAGCTTACAACCCTGTTGCTCACAGAACTTTCCAATCTATGCGGAAATACAGCAGTGCATGGGAATAATCAGAAACCAAATTCTGGCATTAGTACCGATGTAG
Protein:  
MAPARKSRSVNKRFSPTTEISPSKDDSAKKNLRKRKLSDMLGPEWSEEDLTRFYQAYRKYGKDWKKVAAAVKPRTSEMVEALYMMNRAYLSLPEGTASVVGLIAMMTDHYCNLAASDSEQESNEDAGTSRKPQKRARGKVQSNISKAYEMTSPTLAASHGCLTLLKKKRSGGSRPRAVGKRTPRFPVSFSCGNPKGEKYFSPSRQSLKLQADDTDDDVKIALVLTEASQRGGSPQVSQTPNRRTDSAMSSPVETAERKHVKIGMGNAKLLSNEVDEEEGSMEADTGELLRYKNDSVETGTFGRTAQKGRRPYGKKLEIDDSGANHFDDIKEACSGTEEGQILGVVRGKLEMEATYEKNSRTSLQGPRKRSKKVLFSRDESSAFDALQTLADLSLMMPTAENEDESMIQFNDELDDHVDESGSLEAVPANRHRDKRGSGGVRSRWSQPLSKFEVASTTKSKHGKVTSTDVSAVPETKQARRAHKAMSSKARKTEGHVNNNVAGSEEAEAKEASKKSTYKGKRSYQSASPKLIKDQEPSSCADPRTERSDSAQSTAEIPVANQVNLPTKVRSRRKMDLKKPQRQKDLKIPDKSLDDTSASFTALHDRAFSLKENISNCLSNHQVRRWCTYEWFYSAIDYPWFAKRDFVEYLNHVGLGHVPRLTRVEWGVIRSSLGKPRRFSEQFLNEEKEKLNQYRESVRTHYTELREGTREGLPTDLARPLSVGQRVIAIHPKTREIHDGSVLTVDRSRCRVQFDRPELGVEFVMDIDCMPLNPFENMPTLLTRRADAVDKFFESFNELKVNARVNEYMKFPACDNMENGNVFSHFSPPSHPISDLLKQTKVASAEVDMQSRFGVMETAIYQSTAYSKSSGVAQIQAKEADVQALAELARALDKKEAVVSELRRMNDDVLENQTSNDCSLKDSETFKKQYAAMLIQLNEVNEEVSSALYRLRQRNTYQGSIPLAFPRPVPNFADPSTLSTFDRCTSQSQESGFLVNEIIESSKIKARTMVDAAVQAMLSLTDRDNTTEKIEEAIGYVNDRIPLDDSCMPTQPTDPKSKNMSDKNEAEIPSELITKCVATLLMIQKCSERQFPPADVAKVLDSAVASLQPCCSQNFPIYAEIQQCMGIIRNQILALVPM